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Methods for the Diagnosis and Prognosis of Acute 

Leukemias 

Background of the Invention 

Field of the Invention 

The present invention relates to methods of classifying acute leukemias. 
More particularly, the invention relates to methods of distinguishing acute 
myeloid leukemia (AML) from acute lymphoblastic leukemia (ALL) by 
measuring the nucleic acid levels or gene product (protein) levels of small 
combinations (two, three or more) of particular human genes. The invention is 
also useful as a prognostic indicator in AML. 

Related Art 

A major challenge of cancer treatment has been to target specific therapies 
to pathogenically distinct tumor types, to maximize efficacy and minimize 
toxicity. Improvements in cancer classification have thus been central to 
advances in cancer treatment. 

Cancer classification has been based primarily on morphological 
appearance of the tumor, but this has serious limitations. Tumors with similar 
histopathological appearance can follow significantly different clinical courses 
and show different responses to therapy. In a few cases, such clinical 
heterogeneity has been explained by dividing morphologically similar tumors 
into subtypes with distinct pathogeneses. Key examples include the subdivision 
of acute leukemias, non-Hodgkin's lymphomas, and of childhood "small round 
blue cell tumors" into neuroblastomas, rhabdomyosarcoma, Ewing's sarcoma, and 
other types. For many more tumors, however, important subclasses are likely to 
exist but have yet to be defined by molecular markers. For example, prostate 
cancers of identical grade can have widely variable clinical courses, from 
indolence over decades to explosive growth causing rapid patient death. 
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Cancer classification has been difficult in part because it has historically 
relied on specific biological insights, rather than systematic and unbiased 
approaches for recognizing tumor subtypes. 

Acute leukemia is a disease of the leukocytes and their precursors. It is 
characterized by the appearance *of immature, abnormal cells in the bone marrow 
and peripheral blood and frequently in the liver, spleen, lymph nodes, and other 
parenchymatous organs. The clinical picture is marked by the effects of anemia, 
which is usually severe (fatigue, malaise), an absence of functioning granulocytes 
(proneness to infection and inflammation), and thrombocytopenia (hemorrhagic 
diathesis). The spleen and liver usually are moderately enlarged, while enlarged 
lymph nodes are seen mainly in the pediatric lymphoblastic leukemias. Fever and 
a very high ESR complete the picture. Leukocyte counts vary greatly in the acute 
leukemias. About one-fourth to one-third of cases begin with a low white blood 
count (sub- or aleukemic leukemia), while about half show some degree of 
leukocytosis. Mature granulocytes may still be found in the peripheral blood in 
addition to abnormal forms. The coexistence of immature and mature cell forms 
is termed "hiatus leucaemicus." The leukocytopenic forms are the most difficult 
to differentiate from aplastic anemias, pancytopenias, and the myelodysplastic 
syndromes. Bone marrow aspiration is usually necessary to establish a diagnosis. 
Aspirated marrow is found to be permeated by abnormal cells (paramyeloblasts, 
paraleukoblasts, nonclassifiable cells (N.C), leukemic cells, blasts, etc.) with 
little or no evidence of normal hematopoiesis. 

The acute leukemias have traditionally been classified according to 
morphologic, cytochemical, and/or immunologic criteria. An overview of acute 
leukemia classification can be found in the "Atlas of Acute Leukemia" available 
on the world wide web at www.meds.com/leukemia/atlas/acute-leukemia.html. 

As a brief historical review, the classification of acute leukemias began 
with the observation of variability in clinical outcome (Farber, S., et aL, N. Engl 
J. Med. 238:1%1 (1948)) and subtle differences in nuclear morphology (Forkner, 
C.E., Leukemia and Allied Disorders , MacMillan, New York (1 938); Frei, E., et 
at, Blood 75:431 (1961); Medical Research Council, Br. Med. J. 1:1 (1963)). 
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Enzyme-based histochemical analysis were introduced in the 1960s to 
demonstrate that some leukemias were periodic acid-Schiff positive, whereas 
others were myeloperoxidase positive (Quaglino, D., and Hayhoe, F.G.J., J. 
Pathol 78:521 (1959); Bennett, J.M., Dutcher, T.F., Blood 33:341 (1969); 
5 Graham, R.C., et al, J. Histochem, Cytochem 75:150(1 965)). This provided the 

first basis for classification of acute leukemias into those arising from lymphoid 
precursors (acute lymphoblastic leukemia, ALL) or from myeloid precursors 
(acute myeloid leukemia, AML). This classification was further solidified by the 
development in the 1970s of antibodies recognizing either lymphoid or myeloid 

10 cell surface molecules (Tsukimoto, I., et al, N. Eng. J. Med. 294:245 (1976); 

Schlossman,S.F.,e/ a/.. Proc. Natl. Acad. ScL U.S.A. 73: 1288 (1976); Roper, M., 
etal, Blood 67:830(1983); Sallan, B.S.E., et al 9 Blood 55:395 (1980); Pesando, 
J.M., etal, Blood 54: 1240 (1979)). Most recently, particular subtypes of acute 
leukemia have been found to be associated with specific chromosomal 

15 translocations — for example, the t(12;21)(pl3;q22) translocation occurs in 25% 
of patients with ALL, whereas the t(8;21)(q22;q22) occurs in 15% of patients 
with AML (Golub, T.R., et al, Proc. Natl Acad. Set. U.S.A. 92:4911 (1995); 
McLean, T.W., et al, Blood 88:4252 (1996); Shurtleff, S.A., et al, Leukemia 
9:1985 (1995); Romana, S.P., et al, Blood 86:4263 (1995); Rowley, J.D., Ann. 

20 Genet. 16: 109 (1973)). 

Although the distinction between AML and ALL has been well- 
established, no single test is currently sufficient to establish the diagnosis. 
Rather, current clinical practice involves an experienced hematopathologist's 
interpretation of the tumor's morphology, histochemistry, immunophenotyping, 

25 and cytogenetic analysis, each performed in a separate, highly specialized 

laboratory. Although usually accurate, leukemia classification remains imperfect 
and errors do occur. 

Distinguishing ALL from AML is critical for successful treatment; 
chemotherapy regimens for ALL generally contain corticosteroids, vincristine, 

30 methotrexate, and L-asparaginase, whereas most AML regimens rely on a 

backbone of daunonibicin and cytarabine (Pui, C.H., and Evans, W.E., N. Engl 
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J. Med. 339:605 (1998); Bishop, J.F., Med. J. Aust. 770:39 (1999); Stone, R.M. 
andMayer > R.J.,«ewato/. Oncol. Clin.N.Am. 7:47(1993)). Although remission 
can be achieved using ALL therapy for AML (and vice versa), cure rates are 
markedly diminished, and unwarranted toxicities are encountered. 

Recently, Golub, T.R., et al, Science 286: 531-537 (Octoberl 999), have 
reported on a cancer classification scheme for AML and ALL based on the gene 
expression monitoring of 50 human genes. Although the 50-gene predictor 
approach for diagnosing AML versus ALL fared well in validation studies, the 
Golub et al. report noted that the average prediction strength was lower for 
samples from a different laboratory, thus emphasizing the importance of 
f standardizing sample preparation. Further, the application of 50 genes for AML- 
• ALL class distinction may not be desirable for a clinical setting. A method/tool 
employing fewer indicator genes/gene products than used by Golub et al. would 
provide increased ease, increased speed, and reduced cost. Potential for human 
error (misidentification) could be reduced. Reliance on expert, trained 
interpretation of data could also be reduced. Rapid diagnosis based on the non- 
random correlations ("diagnostic signatures" or "fingerprints") according to the 
invention described below thus would produce enormous benefit. Clearly, there 
is a continued need for simpler and less costly objective cancer classification 
approaches, especially for the classification of acute leukemias. 

Summary of the Invention 

The inventors have discovered that measuring the levels of small 
combinations (two, three or more) of particular human genes (in terms of nucleic 
acid or protein levels) can be used to distinguish AML from ALL. Accordingly, 
the present invention overcomes the disadvantages of the prior art by providing 
a method for diagnosing leukemia by measuring the levels of a lesser number of 
genes than provided in the art. 

The invention also provides a preferred embodiment of the foregoing 
method wherein the human genes used to diagnose are LYN V-yes-1 Yamaguchi 
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sarcoma viral reJat^S^ndQgene homolog, PPGB Protective protein for beta- 
galactosidase,\md Zyxin. / 

In the m^st-pr^rred embodiment of the foregoing method, the genes 
used to diagnose are: leukotriene C4 synthase (LTC4S) gene and Zyxin. 

The invention also provides a very particularly preferred embodiment of 
the foregoing methods, wherein the level of gene expression is measured using 
a DNA microchip. 

The present invention also provides an embodiment, whereby the 
measurement of at least two human genes is used as a prognostic indicator of 
AML. 

The present invention also provides a kit for diagnosis or prognosis of 
leukemia 

The invention also relates to therapies targeted at the indicator genes 
described herein, as well as the screening of drugs for cancer that target these 
indicator genes or their protein products. 

It is to be understood that both the foregoing general description and the 
following detailed description are exemplary and explanatory and are intended 
to provide further explanation of the invention as claimed. 

Detailed Description of the Preferred Embodiments 

The inventors have discovered that measurement of the levels of only a 
few human genes (nucleic acid levels or protein levels) can be used to distinguish 
AML from ALL. By "nucleic acid" is intended RNA or DNA, preferably 
mRNA or cDNA derived therefrom. Accordingly, the present invention 
overcomes the disadvantages of the prior art such as Golub et al (1999), supra, 
by providing a method for diagnosing and classifying acute leukemia by 
measuring the expression levels of a lesser number of genes or gene products. 

The names of the genes useful in diagnosis and/or prognosis described 
herein are as designated by Affymetrix and Golub et aL f and, according to them, 
correspond, as indicated in Appendix B, to particular GenBank entries. 
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The invention also provides a preferred embodiment of the foregoing 
method wherein the human genes used to diagnose are: LYN V-yes- 1 Yamaguchi 
sarcoma viral related oncogene homolog, PPGB Protective protein for beta- 
galactosidase,andZyxin. These gene names are as assigned by Affymetrix and 
Golub et al., and according to them, correspond to GenBank Accession Nos. 
M16038_at, M22960_at, and X95735_at, respectively. 

In the most preferred embodiment of the foregoing method, the genes 
used to diagnose are: leukotriene C4 synthase (LTC4S) gene and Zyxin. These 
gene names are as assigned by Affymetrix and Golub et ai, according to them, 
correspond to GenBank Accession Nos. U50136_mal_at, and X95735_at, 
respectively. 

Other embodiments employ other csets which are identified in Appendix 

A. 

It is expected that, for certain csets, an inverse pattern of gene expression 
of ALL markers, as disclosed herein, would correlate with AML diagnosis. 
Likewise, an inverse pattern of gene expression of AML markers, as disclosed 
herein, would correlate with ALL diagnosis. 

The invention also provides a very particularly preferred embodiment of 
the foregoing methods, wherein the level of gene expression is measured using 
a DNA microchip. 

The present invention also provides an embodiment, whereby the 
measurement of small combinations (two, three or more) of particular human 
genes is used as a prognostic indicator of AML. 

The present invention also provides a kit for diagnosis or prognosis of 
leukemia. 

Gene expression data from the database 
http://waldo.wi.mit.edu/MPR/data_set_ALL_AML.html (which was made 
publicly available on October 15, 1999) was analyzed as described below. Per 
Golub et at., Science 286: 531-537 (Oct 15 1999), incorporated herein by 
reference, the database contains the levels of expression of each of 7 1 29 genes for 
each of 72 leukemia samples, which levels were determined using Affymetrix 
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genechip technology. The samples were classified by Golub al as either acute 
myeloid leukemia (AML) or acute lymphoblastic leukemia (ALL) and this 
information is also included in the database. The database further includes 
clinical data on 15 individual acute myeloid leukemia (AML) samples, with 
respect to treatment success or failure. 

The present inventors set out to detect signal(s) from the noise in the huge 
data set, Le., to identify previously unrecognized correlated gene expression 
levels of groups of genes. To this end, the raw gene expression data was used in 
that form or processed using a standard data normalization technique (linear 
transformation followed by logarithm). Next, the expression levels for each gene 
were subjected to one of two standard data clustering techniques ("K means" as 
practiced by those skilled in the art or "Mutual nearest neighbors" as described 
inJarvis,R.A. and Patrick, E. A., IEE Trans. Computers C-22: 1025- 1034 (1973)). 
Such pre-processing made the subsequent identification of correlations more 
convenient. "Clustering", as it is commonly held in the art, refers to methods for 
grouping "objects" of a system based on some similarity measure. The set of 
values in the system being analyzed is replaced by another, smaller set of values 
in a way that reflects the original distribution according to a chosen distance 
metric. In effect, clustering forces objects into likely groups. Here, the objects 
were the various experimentally determined levels of expression of a particular 
gene. The clustering algorithm provided grouping of the expression level for 
each gene into classes, as set forth in Appendix A. For example, referring to line 
3 of Appendix A (cset 2), experimentally determined expression levels of gene 
1 745 may be grouped into low (A, mean = 429.4) and high (B, mean = 221 1 .2). 
In contrast, the grouping of expression levels for gene 3320, line 1 (csetl) was 
into three classes, low (A, mean = 923.6), medium (B, mean = 2405.8), and high 
(C, mean = 3496.8). (See Appendix B for the Aflymetrix and Golub et al 
assigned name corresponding to the gene numbers employed herein. For 
example, gene 1745 corresponds to Affymetrix and Golub et al name LYN V- 
yes-1 Yamaguchi sarcoma viral related oncogene homolog). 
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Next, the pre-processed data was subjected to a variant of the 
"coincidence detection" method described in International Patent Publication No. 
WO 98/43182, published October 1, 1998 (incorporated herein by reference). 
This method provides the identification of features which are sets of attributes 
(values) that co-occur more often than by random assortment and, accordingly, 
the identification of inherent, often unexpected features of a system. Unlike other 
approaches to such identification, the number of members of the identified set is 
not chosen prior to application of the method. That is, some approaches seek 
correlations between paireof attributes (binary or 2-ary correlations). Instead, the 
coincidence detection method does not impose that k ( as in fc-ary correlations) 
be any specific number. Rather, the patterns inherent in the system are 
uncovered. As employed herein, "objects" were samples and "attributes" were 
gene expression values for particular genes, the ALL versus AML diagnosis , and 
treatment outcome for some AML samples. The high-order correlations 
("coincidence sets" or "csets") discovered by the coincidence detection method 
were further filtered and sorted by application of another correlation test. 
Matthews correlation (also known as "Four-point Correlation") is a standard, 
known, though less commonly-used variant of the standard Pearson correlation 
measure, especially suited for discrete (as opposed to continuous) data. In this 
case, a Matthews correlation was calculated between (1) particular correlated 
gene expression values, considered together for the k genes in the particular cset 
and (2) the attribute corresponding to AML or ALL diagnosis, and the csets were 
sorted from highest to lowest Matthews correlation. These Matthews-tagged 
csets may be interpreted as "rules" relating particular genes and their expression- 
value ranges to diagnosis or prognosis. A plausible English interpretation of such 
a discovered rule (see second cset in appendix A) might be, for example, 

" Gene 1 745 has expression level A (LOW relative to a control that is, 
value closest to the calculated cluster mean of 429 for this gene in one analysis 
performed and described herein) AND Gene 1829 has value B (LOW relative to 
a control) AND Gene 4847 has value A (LOW relative to a control) IF AND 
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ONLY IF the patient has leukemia type ALL (with probability based on Matthews 
correlation of 0.9077). " 

Appendix A shows csets obtained from clustered raw data and from 
clustered log normalized data. Where the same cset appears more than once in 
Appendix A, this derives from results of multiple experimental runs (different 
clustering techniques). 

Thus, using these techniques, the present inventors discovered small 
combinations of genes that provide a diagnostic indication of acute leukemia 
subtype. In addition, they also discovered small combinations of genes that 
provide a prognostic indication for AML. 

As these results indicate dependence of leukemia subtype on clustered 
gene expression levels, they are also indicative of dependence of the subtype on 
unclustered (or raw) gene expression levels. This latter relationship was 
confirmed by the present inventors using supervised learning techniques 
(artificial neural networks, decision trees, etc.) as known by those skilled in the 
art and as described in Mitchell, T.B., in: Machine Learning, chapters 3 and 4, 
McGraw-Hill (1997). The expression levels, for the genes discovered by the 
coincidence detection method, were given (in raw form, that is, unnormalized and 
unclustered) to the supervised learning agent and the subtype of leukemia (AML 
versus ALL) was predicted. The training of a neural network, and the use of a 
trained neural network for prediction or classification, is well known to those 
skilled in the art. 

Genes correlated with specific disease subtypes are likely to have a 
specific role in the disease condition, and hence are valuable targets for new 
therapeutics. 

Genes correlated with disease prognosis are likely to have a specific role 
in the disease condition, and hence are valuable targets for new therapeutics. 
Accordingly, the invention provides methods of screening for drugs that 
modulate (enhance or "inhibit) expression of genes in the csets, or modulate 
(enhance or inhibit) the activity of products of such genes. 
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For example, screening methods for identifying compounds capable of 
treating acute leukemia include contacting cells with the candidate compound, 
measuring gene expression, and comparing the gene expression of a particular 
cset to a standard expression of a particular cset, the standard being assayed when 
contact is made in absence of the candidate compound; whereby, a difference in 
gene expression indicated that the compound may be useful for treating particular 
subtypes of acute leukemia. 

High-order correlated genes are likely to play a synergistic or antagonistic 
role in the disease condition, and are likely to reveal important pathways involved 
in the disease process. 

Certain tissues in mammals with leukemia express enhanced and/or 
diminished levels of certain proteins and mRNA when compared to a 
corresponding "standard" mammal, i.e. , a mammal of the same species not having 
the leukemia. Further, it is believed that enhanced levels of certain proteins and 
mRNA can be detected in certain body fluids (eg. , sera, plasma, urine, and spinal 
fluid) from mammals with leukemia when compared to body fluids from 
mammals of the same species not having the leukemia. Thus, the invention 
provides a diagnostic method useful during leukemia diagnosis, which involves 
assaying the expression level of a gene or set of genes in mammalian cells or 
body fluid and comparing the gene expression level with a standard gene 
expression level, whereby a difference in the gene expression level over the 
standard is indicative of a specific type of leukemia. In the working examples 
disclosed herein, comparison was made between ALL and AML samples. 

Where a leukemia diagnosis has already been made according to 
conventional methods, the present invention is useful for confirmation thereof 
and as a prognostic indicator, where patients exhibiting differing gene expression 
will experience a better or worse clinical outcome relative to other patients. 

By "assaying the level of the gene expression" is intended qualitatively 
or quantitatively measuring or estimating the level of the protein or the level of 
the mRNA encoding the protein in a first biological sample either directly (eg., 
by determining or estimating absolute protein level or mRNA level) or relatively ' 
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by comparing to the protein level or mRNA level in a second biological 
sample). 

Preferably, the protein level or mRNA level in the first biological sample 
is measured or estimated and compared to a standard protein level or mRNA level 
{e.g., ALL sample v. AML sample), the standard being taken from a second 
biological sample obtained from an individual not having that leukemia. As will 
be appreciated in the art, once a standard protein level or mRNA level is known, 
it can be used repeatedly as a standard for comparison* 

By "biological sample" is intended any biological sample obtained from 
an individual, cell line, tissue culture, or other source which contains protein or 
mRNA. Biological samples include mammalian body fluids (such as sera, 
plasma, urine, synovial fluid and spinal fluid) which contain secreted mature 
protein, and ovarian, prostate, heart, placenta, pancreas liver, spleen, lung, breast 
and umbilical tissue. 

The present invention is useful for detecting acute leukemia in mammals. 
Preferred mammals include monkeys, apes, cats, dogs, cows, pigs, horses, rabbits 
and humans. Particularly preferred are humans. 

In order to detect gene expression, total cellular RNA can be isolated from 
a biological sample using the single-step guanidinium-thiocyanate-phenol- 
chloroform method described in Chomczynski and Sacchi, Anal Biochem. 
762:156-159(1987). Levels of mRNA encoding the protein (or cDNA prepared 
from such mRNA) are then assayed using any appropriate method. These include 
Northern blot analysis (Harada et ai, Cell (53:303-312 (1990)), SI nuclease 
mapping (Fujita et al, Cell 49:351- 367 (1987)), the polymerase chain reaction 
(PCR), reverse transcription in combination with the polymerase chain reaction 
(RT-PCR)(Makinoera/., Technique 2:295-301 (1990)), and reverse transcription 
in combination with the ligase chain reaction (RT-LCR). 

Protein levels may be determined by assaying enzymatic activity of the 
protein. This is especially useful when screening potentially useful therapeutic 
drugs that affect protein activity. 
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Assaying protein levels in a biological sample can also be performed 
using antibody-based techniques. For example, protein expression in tissues can 
be studied with classical immunohistological methods (Jalkanen, M., et al t J. 
Cell Biol 707:976-985 (1985); Jalkanen, M, etal, J. Cell, Biol 705:3087-3096 
(1987)). This is useful when screening drugs as potential therapeutics that affect 
gene expression. 

Other antibody-based methods useful for detecting protein gene 
expression include immunoassays, such as the enzyme linked immunosorbent 
assay (ELISA) and the radioimmunoassay (RIA). 

Suitable labels are known in the art and include enzyme labels, such as, 
glucose oxidase, horseradish peroxidase and alkaline phosphatase; radioisotopes, 
such as iodine ( l25 I, ,2, I), carbon ( M Q, sulfur ( 35 S), tritium ( 3 H), indium ( ,,2 In), 
and technetium ( 99m Tc); fluorescent labels, such as fluorescein and rhodamine; 
and biotin. 

In a preferred embodiment, gene expression is measured using a DNA 
microchip, as described below in Example 3. DNA microchips are described in 
U.S. Patent Nos. 5,744,305; 5,424,186; 5,412,087; 5,489,678; 5,889,165; 
5,753,788; and 5,744,101; and WO 98/12559; and Harris, Exp. Opin. Ther. 
Patents 5:469-476 (1995). DNA microchips contain oligonucleotide probes 
affixed to a solid substrate, and are useful for screening a large number of 
samples for gene expression. 

The present invention also further includes kits for diagnosing subtypes 
of acute leukemia, comprising a means for measuring gene expression of each 
gene of a cset which is herein disclosed as being correlated with a subtype of 
leukemia, wherein said means are within a container. In one embodiment, a kit 
is provided which comprises a means for measuring gene expression of LYN V- 
yes~l Yamaguchi sarcoma viral related oncogene homolog, a means for 
measuring gene expression of PPGB Protective protein for beta-galactosidase, 
and a means for measuring gene expression of Zyxin. In one embodiment, the 
means for measuring gene expression is a DNA microchip which contains probes 
specific for the target gene(s). In another embodiment, the means for measuring 
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gene expression is an antibody specific for the protein of interest. Other means 
for measuring gene expression are well known in the art. 

The invention also relates to therapies targeted at these indicator genes, 
as well as the screening of drugs for cancer that target these indicator genes or 
their protein products. 

Having generally described the invention, the same will be more readily 
understood by reference to the following examples, which are provided by way 
of illustration and are not intended as limiting. 

Examples 

Example 1 

Those skilled in the art can, by the exercise of ordinary skill, measure the 
mRNA or protein level for each of the two, three or more (preferably two to six) 
genes in a correlated set discovered to be diagnostic for leukemia subtype and, 
in reference to a standard, classify new cases of leukemia with respect to subtype. 
Such an analysis would be highly amenable to modern diagnostic "chip" 
technology and suitable for incorporation into a bedside diagnostic device. 

For example, in reference to Appendix A, page a, cset 2, the expression 
level of Affymetrix designated genes LYN V-yes-1 Yamaguchi sarcoma viral 
related oncogene homolog (GenBank Accession #M16038)„ PPGB Protective 
protein for beta-galactosidase (galactosialidosis) (GenBank Accession #M22960), 
and Zyxin (GenBank Accession #X95735) is diagnostic of ALL. In this case, 
diagnosis of ALL can be made if the relative expression level of each of these 
genes is low. Similarly, other csets in Appendix A provide diagnostic gene 
"signatures" or "fingerprints" of similar value. 

Example 2 

Those skilled in the art can measure the mRNA or protein level for each 
of the genes in a correlated set discovered to be a prognostic indicator for AML, 
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and in reference to a standard, predict patient response to treatment. Such an 
analysis could be extremely valuable in designating patients as unlikely to 
respond to conventional therapy, and hence targeting them for more intensive or 
more experimental procedures. 

For example, in reference to Appendix C, cset 2, the expression level of 
genes 1436and3847 (Afifymetrix designated genes POU3F1 POU domain, class 
3, transcription factor 1, GenBank Accession No. L26494_at; and GB DEF » 
homeodomain protein HoxA9 mRNA, GenBank Accession No. U82759_at, 
respectively) is a prognostic indicator for AML. In this case, AML prognosis is 
good if the relative expression level of these genes is medium-high and high, 
respectively. 

Example 3 

Total RNA is extracted from tissue samples of a patient with leukemia, 
and cDNA is prepared using methods well known in the art. Double-stranded 
DNA is made from the cDNA. The double-stranded cDNA is transcribed using 
the Ambion T7 MegaScript Kit. The cRNA made from the in vitro-translation 
of the double-stranded cDN A is fragmented by adding 1 5 \ig cRNA to 0.2 vol of 
5X fragmentation buffer and storing at 95 °C for 35 minutes. The fragmented 
cRNA is then added to 3 uL 5 nM Control Oligonucleotide B2 (Final 
concentration: 50 pM)(Affymetrix); 3 uL 10 mg/ml Herring Sperm DNA ( Final 
concentration: 0.1 mg/ml)(Promega/Fisher Scientific); 3 uL50mg/ml Acetylated 
BS A (Final concentration: 0.5 mg/ml)(Gibco BRL Life Technologies); 1 50 ul 2X 
MES Hybridization Buffer (Final concentration: IX). The volume is adjusted 
with DEPC HjO to 300 uL total volume. 

A 1 2X MES Stock buffer is prepared: 70.4 g MES free acid monohydrate 
(Final concentration: 1. 22 MMES)(Sigma Chemicals); 193.3 g MES sodium salt 
(Final concentration: 0.89M [Na+])(Sigma Chemicals); 800 ml DEPC H 2 0; the 
volume is brought up with water to 1000 ml. pH should be between 6.5 and 6.7. 
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A DNA microchip, containing probes for LYN V-yes-1 Yamaguchi 
sarcoma viral related oncogene homolog, PPGB Protective protein for beta- 
galactosidase, and Zyxin, is prepared using, for example, the methods described 
in U.S. Patent No. 5,744,305, which is herein incorporated by reference. The 
microchip is equilibrated to room temperature just before use. The chips are pre- 
wet with 200 uL of 1 X MES Hybridization buffer at 45°C for 1 0-20 minutes, 60 
RPM. The fragmented cRNA is heated at 99°C for 5 minutes and cooled at 45°C 
for 5 minutes, then spun at maximum speed for 5 minutes. The IX MES 
hybridization buffer is removed from chips, and 200 j^l fragmented cRNA is 
added to each chip. The chips are incubated at 45 °C, 60 RPM for 1 6 hours. After 
16 hour hybridization, the cRNA is removed from the chip and stored at -80° C. 

For each chip: 1200 uLSAPE (Streptavidin Phycoerythrin) Solution is 
prepared, using 600 uL 2X Stain buffer, 120 uL 20 mg/mL Acetylated BSA 
(Final concentration: 2 mg/mL); 12 uL 1 mg/mL SAPE (Final Concentration: 1 0 
ug/mL)(Molecular Probes); 468 vL DEPC H 2 0. 600 uL Antibody Solution is 
prepared, using: 300 uL 2X Stain Buffer, 60 uL 20 mg/mL Acetylated BSA 
(Final concentration: 2mg/mL); 30 uL goat serum (Final concentration: 
5%)(Sigma Chemical); 3.6 uL 0.5 mg/mL biotinylated anti-streptavidin antibody 
(Final concentration: 3 ug/mL)(Vector Laboratories); and 206.4 uL DEPC H 2 0. 

2X Stain buffer is prepared using 41 .7 ml 12X MES Stock Buffer (Final 
concentration: 100 mM MES); 92.5 ml 5 M NaCl (Final concentration: 1 M 
[Na+] ); 2.5 ml 10% Tween 20 (Final concentration: 0.05% Tween); 112.8 ml 
DEPC H 2 0; filtering through a 0.2 urn filter; after filtering, add 0.5 ml of 5% 
Antifoam. 

Hybridization is performed using the Affymetrix GeneChip© Fluidics 
Station 400 at 1 0 cycles of 2 mixes per cycle with Non-Stringent Wash Buffer at 
25°C; 4 cycles of 1 5 mixes per cycle with Stringent Wash Buffer at 50°G; probe 
is stained with the first aliquot of the SAPE solution for 10 minutes at 25°C; 10 
cycles of 4 mixes per cycle at 2°C; probe is stained in antibody solution for 1 0 
minutes at 25°C; probe is stained with the second aliquot of SAPE for 10 minutes 
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at 25°C; final wash is 15 cycles of 4 mixes per cycles at 30°C; holds at 25°C. 
The plates are scanned using the Hewlett-Packard GeneAnay© Scanner 
(Affymetrix). 

Example 4 

Those skilled in the art can, by the exercise of ordinary skill, measure the 
mRNA or protein level for each of the two, three or more (preferably two to six) 
in a correlated set discovered to be diagnostic for leukemia subtype and, in 
reference to a standard, classify new cases of leukemia with respect to subtype. 
Such an analysis would be highly amenable to modern diagnostic "chip" 
technology and suitable for incorporation into a bedside diagnostic device. 

For example, in reference to Appendix A, page i, cset 1 for AML, the 
expression level of Affymetrix designated genes Zyxin (GenBank Accession 
#X95735_at) and . ELA2 Elastase 2, neutrophil (GenBank Accession 
#M27783_at) is diagnostic of AML. In this case, diagnosis of AML can be made 
if the relative expression level of each of these genes is high. Similarly, other 
csets in Appendix A provide diagnostic gene "signatures" or "fingerprints" of 
similar value. . 
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It will be clear that the invention may be practiced otherwise than as 
particularly described in the foregoing description and examples. 

Numerous modifications and variations of the present invention are 
possible in light of the above teachings and, therefore, within the scope of the 
appended claims, the invention may be practiced otherwise than as particularly 
described. 

The entire disclosure of all publications (including patents, patent 
applications, journal articles, databases, GenBank entries, web sites, laboratory 
manuals, books, or other documents) cited herein are hereby incorporated by 
reference. 
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Appendix A 



ALL Predictors 

Clustered Raw Data 



Matthews 
Relation 

0.8094 
0.9077 

0.8813 



0.8774 

0.8774 
0.8768 

0.8788 

0.8768 

0.8629 

0.8620 
0.8820 

0.8488 
0.8486 



0.8486 

0.8486 
0.8462 
0.8458 

0.8458 



Observed Association 



4 5 ALL 
46ALL 

44 ALL 

46ALL 

45ALL 
48 ALL 

48 ALL 

46 ALL 

42 ALL 

42 ALL 
42ALL 

44 ALL 
44 ALL 

44 ALL 

44 ALL 

46 ALL 
45ALL 

45ALL 



Velue:CQ«ne^320wtiereA^2405.82 B-3498,8 C-923.571 
Value* Gene:4847vyhereA*318.787 B=3397.48 
Value* Gene:1745 where A=«429.413793 B=22t 1.214286 
Value:B Gene:1829 where A*24$0.666667 8*522.245814 
Value* Gene:4847 where A^.117647 8=3703.809524 
Value* 6eno^288 where A=28.181818 B-708S.235294 
Value* Gene:3252 where A-101.470588 Bai682.O000OO 
Value:BGene3320 where A=2693.2352&4 B-906.063836 
Value* Gene:4647 where A=434,117647 B=3703 609524 
VelU8:C Qene:780 where A-8172.4 8-3964 0376.25 
Value* Gene:4847 where A-318.787 B=3397.48 
Value* Gane:4B47 where A-316.787 B-3307.48 
Value* Gene:4847 where A-434.1 17647 B=3703.809524 
Value* Gena:6919 where A*280.034483 8=1432.428571 
Value* Gone;i779 where A=884.650000 8=15238.916667 
Value* Gene:4847 where A*434.117647 8*3703.800524 
Value* Gene:2288 where A=28.181818 B=7065.235294 
Vahie*Gene;4847whereA=434.ii7647 B=3703.809524 
Value* Gefie:2121 where A=1 739.65 8=6935.94 
Value* Gene:3232 where A-52,0476 B-1536.46 C-169.333 
Value* Gene:3252 where A=5Z0478 B*1536.46 C=169 333 
Vaiue*Gene:32S2 where A=S2.0476 B=1536.46 01 69.333 
Velue:CGene^320 where A-2405.82 B-3496.8 C=023.571 
Value* Gene:l779 where A=884.850000 8-15238.918667 
Value* Gene:3262 where Ap101. 470588 BM662.000000 
Value* Gena;4847 where A=434. 11 7647 8=3703.809524 
Value* Gene:1882 where Ap770.250000 8=15876.000000 
Value* Gene:2288 where A=28, 161818 8=7085.235294 
Value* Gene:3252 where A-101.470588 B-1662.000000 
Value* Gene;4847 where A=434. 11 7647 8-3703.809524 
Value* Gene:6376 where A-188.475410 B=2425.818l82 
Value* Gene:2288 where A-28.1 818 18 8-7085.235294 
Value* Gene3252 where A= 101 -470588 B-1662.000000 
Value* Gene:4847 where A*434.1 17847 8=3703.809524 
Value* Gena:3252 where A* 101 ,470588 B-1662.000000 
Value* Gene:4847 where A=434,1 17647 B-3703.609524 
Value* Gene:2121 where A=1738.85 B-6935.04 
Value:CGene:3320 where A=2405.82 B=3496.8 C=923.571 
Value:B Gene:1829 where A-2450.66e667 B-622^45614 
Value* Ger*e:3252 where A-101^470588 B-1662.000000 
Values Gene.-3320 where A=2693.235294 B=906.963836 
Value* Gene:2288 where A»28. 181818 B-7085,235294 
Value* Gene:3252 where AM01. 470588 B*1682,000000 
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Matthewe Obeerved A>s elation 



0.8387 
0.8210 

0.8210 



0.8157 
0.8154 



0.8154 
0.8154 

0.8154 
0.8143 
0.8143 
0.8143 
0.8143 

0.8143 
0.8038 



41 ALL 
43ALL 

43 ALL 



46ALL 
40ALL 



40 ALL 
40AU 

40 ALL 
45ALL 
45 ALL 
45ALL 
45 ALL 

45 ALL 
41 ALL 



Value* Gerte:8803 where A=2Q25.78B885 B=1 0902. 181818 
Value* Gene:6806 where A-1858.393443 8=10826.818182 
Value* Gene;8Q4 where Ap3301. 48 8=10857 0=692.61 5 
Value* Gerw;3252 where A*52,0476 B-1538.48 C=169.333 
Value* Gene.2242 where A=44.150000 8=638,750000 
Value:BGene:3847 where A=887.588235 8=182.090009 
Value* Gene:4847 where A*434.1 17847 8=3703.809524 
Valued Gene;1828 where A=24S0.686687 B-522^45814 
Value* Gera:1834 where Af234.569322 B=1 245.538462 
Value* Gene:3252 where A=101 .470588 B=1 662.000000 
Value:BGene^320 where Ap2893.2352S4 B-906.963838 
Vaiue:&Gane:4499 where A=97£454545 8=209.032787 
Value* Gene:5683 where A-778J83638 B=2486.647059 
Value* Gene:4847 where A-434.1 17847 8=3703.808524 
Value* Gerc>:2121 where A»1739.65 8=6935.94 
Value* Gene:3252 where A=52.0476 8=1536.46 0169,333 
Value* Gene*647 where A=318.787 B^3M7.4flt 
Value* Gene:2128 where A=576.2 8=292.891 01277,12 
0-7459 

Value* Gene:4847 where Af318.787 B-3397.48 
Value* Gene:2363 where ApS22£B3 B=2712 
Value* Gene:3252 where A*52.0476 B-1538.48 0169.333 
Value* Gene:4847 where A=318.787 8=3397.46 
Value* C*ne:3252 where A-S2.0476 B-1S38.48 O160.333 
Value* Gene:4847 where A=318.787 B=3397.48 
Value* Gene:604 where A=3301 .48 8=10857 C-892.616 
Value* Gene:2121 where A=1739.65 B=6935,94 
Value* Gene:4847 where A=434.117647 8=3703.809524 
Value* Gene^201 where A=890.474576 B=13711.461538 
Value* Gene:4847 where A=434,1 17647 8=3703.809524 
Value* Gene:604l where A=851.929826 B-3705.800000 
ValurBGene:1829 where A=2450.666687 B=522^45814 
Value* Gene:1834 where A=234.559322 8=1246.538462 
Value* Gene;3252 where A-101. 470588 B-1882.000000 
Value* Gene:4366 where A-343.200909 8-2419.882353 
Value* Gene-4847 where *=434.1 17847 B=3703.809524 
Value* Gene:1834 where A=234.569322 B-1245.S38462 
Value* Gene*121 where A-1 946.135583 8*7997.384815 
Value* Geee:2288 where A=28.181816 8=7065.235294 
Value* Geae:3462 where Af-37.71 1884 B=67.384615 
Value*Gene:4198 where A= 1409.29 1687 8=7309^76000 
' Value* Gene:4847 Where A=434.117847 B-3703.808524 
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ALL Predictors 

Clustered Log Normalized Data 



Matthew* 
Relation 

0.9095 



Observed A*a elation 



47 ALL 



0.0013 

0.8B13 
08813 

0.8774 

0.8774 
0.B774 
0.8544 

0.8544 

0.8503 

0.8503 
0.8486 
0.8486 

0.8486 



44 ALL 

44 ALL 

44 ALL 

45ALL 

45 ALL 
45 ALL 
43 ALL 

43 ALL 

47 ALL 

47ALL 

44 ALL 
44 ALL 

44ALL 



Value* Gene:1779 where A=44681 10 8*4,834806 
Value* Gene:1882 where A-4482938 0=4.637279 
Value* Gene:2121 where A=4481 018 B=4.560041 
Value* Gefte:2288 where *=4463067 B=4.547863 
Value* Gene:2402 where A=4467723 B«4.633272 
Value* Gene:8376 where A=4.455837 B =4 .48 846 8 
Value* Gene:1615 where A=4462970 8-4488003 
Value* Gene:3482 where A=4452756 8=4454382 
ValueiA Gene:4847 where A=44S6925 8*4.504069 
Values Gene3320 where A*=44026O5 B-4486959 
Value* Gene:4847 where A=445B825 B=»4.5O4069 
Value* Gene:1745 where A-4.469803 B=4484955 
Value:BGene;3320 where Ap44026O6 8-4468059 
Value* Gene:4847 where A«4.458925 8=4,504069 
Value:A Owe: 1745 where A=4.4506O3 8=4.484955 
Value* Oene»88 where A=4.453687 8=4*47863 
Value* Gene:3258 where A-4470301 B=4.548814 
Value* Gene:4847 where A=4458925 B=4.504069 
Value* 0ene;1745 where A=4.459603 B-4.4840S5 
VaJue:BGene4499 where A*4467898 B=4.458513 
Value* Gene:4847 where A*4456926 8=4.504069 
Value* Gene:2288 where A«4453867 8*4.547863 
Value* Gene:3252 where A=4454877 8=4.477933 
ValueiB Oene:3320 where A=4.492605 8^4.468959 
Value* Gene:1778 where A=44681 10 8=4.634806 
Value* Gene:3252 where AM A S4B77 8=4-477933 
Value* Gene:4l90 where A*445310O 8=4.476172 
Value* Gene:5432 where A=4453427 B-4.455339 
Value* Gene:6201 where A=4463253 B=4.612053 
Value* Gene;3252 where A=4454877 B=4477933 
Valua:BGene:3320 where A=4492605 8=4.486959 
Value* Gene:4229 where A=4453830 B-44B9877 
Value* Gene:4847 where A=445B925 8=4.504066 
Vakie* Gene:8583 where A=4462051 8=4490602 
Value* Gene:1834 where A=4456900 8=4.471925 
Value* Gene2121 where AM481919 B=4.660041 
Value* Gene;2286 where A=4453667 8=4.547663 
Value* Gene:2280 where A-4463867 8-4.647853 
Value:B Gene:4499 where A=4467806 B=4456513 
Value:BGene:1820 where A-4488279 8-4460958 
Value* Gene:4847 where A-4458025 8*4.504069 
Value* Gene4288 where A«4453667 B=4.547863 
Value* GeneJ252 where A-4.454877 B=4477933 
Valued Gane;3320 where A=4492805 8=4466959 
Value* Gene:5833 where A=4450538 8-4.463545 
Value* Gene:2286 where A-44S3687 8=4.547863 
Value* Gene:4847 where A=4458825 8=4.504069 
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Matthew* 
Relation 



0,6462 

0.8462 
0.8462 

0.8458 

0.8458 
0.8458 



Observed Ass clatton 



0.8458 

0.8456 
0.84GB 
0.8458 

0.8458 

0.6458 
0.8458 



Value* Gen*6201 where A=4463253 B=4,612053 
46ALL Value* Gene:1834 where A=4,458800 6=4.471925 

Value* 0ene:1882 where A=4.462936 B=4.637279 
Value* Gene:3320 where A=4.4926Q5 6-4.486959 
Value* Gene:6803 where A*4.481823 6=4.588492 
Value* Cene:8806 where A-4.479556 8-4.587092 
46 ALL Value* Gene:2288 where Af4453687 6*4.547863 

VaIue:B Gene.3320 where A=4,492605 B-4.4Q6959 
46 ALL Value:BGene:1829 where A=4,488278 B-4.460956 

Value* Gene:1834 where A=4456800 8-4.471925 
Value* GeiNK2288 where A*4.453867 6=4.547883 

45 ALL Valuo:B Gena:1829 where A=4.488279 6-4.460958 

Value* Gene:1 834 where A«4,4569Q0 B^4.471025 
Value* Gene:2288 where A*4463867 B*4.547863 
Value* Gene:5633 where Ar4.450658 8=4.463545 
Value* Gene:6919 where A-4.4575B4 B=4.474843 
45 ALL Value:B Gene:1829 where A=4.488279 B-4.460958 

Value* Gene:2288 where A=4453687 B=4.547863 
Value* Genetfies where A~4.465723 6-4.524227 

46 ALL Value* Gene:18B2 where A=4.462938 6=4.637279 

Value* Gene:2288 where A=4.453667 6*4.547863 
Value* Gene:2585 where A=4*5S314 6-4.463555 
Value* Gena^262 Where AP4.454877 6=4.477933 
Value* Gene:4229 where A-4.453830 B=4.489877 
Value* GeneOT87 where A-4482005 B-4.586722 
Value* Gene:8803 where AM.481823 6=4.588492 
Value* Gene:6808 where A=4w479556 6=4.587092 
Value* Gene:8919 wham A=4.4575B4 B=4.474643 
46 ALL Value* Gene£288 where A=4>I53B87 B-4.547863 

Value* Gene2402 where A=4.467723 B=4.633272 
Value* Gene:4647 where A-4 468925 6-4.504089 
45 ALL Value* Gene:2288 where A=4.453867 6=4.547863 

Value* Gene:4847 where A=4.458925 B«=4^04069 
45 ALL Value* Gene:4847 where A-4.45B825 6=4.504069 

Value* Gene:6819 where A=4.457584 6=4.474643 
45 ALL Value* Gene:1882 where A=4.482936 B=4.637279 

Value* Gene:2288 where Af4.453687 B-4.647883 
Value* Gene:3252 where A=4454877 8=4.477933 
45 AU Value* Gene:1882 where A*4<462936 B=4.63727G 

Value* Gene4288 where A=4.453667 B=4.547803 
Value* Gene:3252 where A=4 .454877 6*4.477933 
Value* Gene:6803 where A=4,481823 6=4,588492 
Vatu«*Gena:6809 where A-4.478556 B-4.587092 
45 ALL Value* Gene:1882 where A-4.462838 B-4.637279 

Value* Gene:4847 where A-4.A58925 B-4.504069 
45 AU value* Gene:1882 where A-4.462936 6-4.637279 

Value* Gene:4647 where A=4*58925 B=4.504069 
Value* Gene:6797 where A-4.482005 6=4.588722 
Value* Gene;8803 where A-4481823 6-4.588492 
Value* Gene:6806 where A^4»479556 6=4.587092 
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Matthowt 
Relation 

0.8458 



0,8456 
0.8456 

0.82S6 

0.8210 
0.6210 

0.8210 

0.6210 

0.8210 

0.6210 
0.8162 

0.6162 

0.8162 

0.6162 



Observed Am elation 



45ALL 

45ALL 
45ALL 

42ALL 

43ALL 
43 ALL 

43 ALL 

43ALL 

43 ALL 

43 ALL 
44ALL 

44 ALL 
44 ALL 
44 ALL 



Valued Geoo:2268 where A-4.453667 B-4.647863 
Valued 66ne:4647 where A=4.458925 8=4.504069 
Value* Gene:69t9 where A=4.457584 B=4.474643 
Value* Gene;3252 where A=4.454877 8*4.477833 
Value:B Gene:3320 where A=4.492605 8=4.488959 
Value* Gene:4847 where A=4.458926 6=4.504069 
Value* Gene:8797 where A-4.462005 B*4.586722 
Value* Gene:6803 where A=4.481823 8=4.586492 
Value* Gene£806 where AM.470566 8^4.587092 
Value:BGene:1829 where A-4.488279 B=4.460958 
Value* Gene:1834 where A=4.4S6600 B=4.471825 
Value* Gene:3183 where A=4.480033 6*4.607884 
Value* Gene:3320 where A-4492805 B»4.466959 
Value* Gene:4377 where A-4 .463571 B=4.492668 
Value* Gene'2288 where A-4.463867 B=4.547863 
Value* Gene:3252 where A^4.454877 B-4.477933 
Value* Gene:4647 where A»4.4S8925 8=4.504069 
Value* Gene2288 where A=4.453667 B-4 347663 
Value* Gena3252 where A-4454677 B-4.477933 
Value* Gene:4847 where A«4.458925 B=4.504069 
Value* GonetfMI where A=4.463123 6*4.506267 
Value* Geee2288 where A=4.453667 B«L547863 
Value* Genet3252 where A=4.454877 B-4.477933 
Value* Gerw4847 where A»4.458926 8=4.504069 
Value* Gene;6919 where A=4.457564 B-4.474643 
Value* Gene:2363 where A=4.460906 B-4.491798 
Value* Gene:3252 where A=4.454877 B-4.477633 
Value* Gene;4366 where A-4456506 8=4.466664 
Value* Gena:4647 where A=4.45892S 8*4.504069 
Value* Gene;3252 where A=4.454877 8=4.477933 
Value* Gene:4366 where A-4.458506 B-4.488664 
Value* Gene:4847 where A=4458925 B-4.504089 
Value* Gene:3252 where A-4.4S4877 B°4.477933 
Value* Gene;4647 where A=4.458925 8=4.504069 
Value* Gene:1745 where A=4.459603 8=4-484955 
Value* Gene:2288 where A=4.453687 B=4.547863 
Value* Gene:4229 where A=4^53630 B=4.469877 
Valuo:BGene:448e where A«4<487896 B-4.456513 
Value* Gene£280 where A-4.461424 8=4.480763 
Value* Gene:1745 where A-4.459603 8=4.484855 
Value* Gene2286 where A=4.453667 B=4.547863 
Value* Gene:4847 where A>*M66925 B-4.504069 
Value* Gene:5633 where A=4.4S0556 B=4.46354S 
Value* Gene:6919 where A-4,467684 B-4.474643 
Value* Gene;3252 where A=4.454877 B=4.477933 
Value* Gerra£005 where A*4.462043 B«4.460720 
Value* Gene«803 where A=4.461823 B=4.588482 
Value* Gene:6806 where A=4.479556 8=4.567062 
Value* Gene:6910 where A=-4.457584 B-4.474843 
Value* Ger>e:1280 where A-4.457738 B«4,4S4284 
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Matthew* observed Aaaodat! 
Re tatl n 



0.8162 
0.8162 

0.8162 

0.8162 
0.8162 

0.8162 

0.8162 
0.8157 



0.8143 
0.8143 



0.8143 
0.8143 
0.8143 
0.8038 
0.8036 
0.8038 

O.6038 



44 ALL 
44 ALL 

44 ALL 

44 ALL 

44 ALL 

44 ALL 

44 ALL 
46 ALL 



46ALL 
45 ALL 



45 ALL 
45 ALL 
45ALL 
41 ALL 
41 ALL 
41 ALL 

41 ALL 



Value* Gene:2288 where A-4.453667 6=4.547863 
Value* Gene:4847 where A-4.458926 6=4.504069 
Value* Gene:1615 where A-4.462970 6-4.488003 
Value* Gene:4847 where A=4.4S8925 8=4.504069 
Valua:B Gene: 1829 where A-4,488279 B=4 .460958 
Value* Gene2242 Where A=4.4S4008 6=4.461486 
Value* Gene:6201 where A=4.463253 6=4.812053 
Value* Gena:6584 where A-4.458951 8*4.474055 
Value* Gene-2288 where A=4.453667 6=4.547863 
Value* Gei»:3252 where A-4.454877 6-4.477933 
Value* Gene£833 where A=4.460558 6=4.463545 
Value* Gene:6041 where A=4 .463123 6=4.506267 
Value* Gene:2288 Where A=4>*53667 8=4.547883 
Value* Geiw4847 Where A-4.458025 8*4.504069 
Value* Gene:5833 where A-4.460558 6=4.463545 
Value* Gene2288 where A=4.453867 6=4.547663 
Value* Gene:4B47 where A=4,458925 6-4.504069 
Value* Gen*5833 where A-4.460558 6=4.463545 
Value* Gene:6S18 where A-4<457584 6=4.474643 
Value* Gone4847 where A«44S8925 8=4.504069 
Value* Gene.6185 where A^4.485723 6=4.524227 
Value* Gene:6919 where A-4.457564 6=4.474643 
Value* Gene*647 where A=4,458925 6-4-604069 
Value* Gene:6201 where A-4.463253 6=4.612053 
Value* Gene-,1779 where A=44661 10 6=4.634806 
Value* Gene:2268 where A=4.453667 6=4.547863 
Value* Gene:5833 where A=4.450558 6-4.463545 
Value* Gene:4847 where A-4.456925 8=4.504099 
Value* Gene:1280 where A=4.46TO9 6=4454284 
Value* Gene:1400 where A=4.4701 68 8=4.545801 
Value* Gene:2137 where A=4.451 155 6=4.461070 
Value* Gene^268 where AP4453687 6-4.547663 
Value* Gene:4368 where A-4.456506 6=4.486664 
Value* Gene;604 1 where A=4.4631 23 6=4.508267 
Value* Gene;1745 where A=4.459603 6=4.484955 
Value* Gene:4847 where A-4.458925 6=4.504069 
Value* Gene:2121 where A=4j4B1919 B-4.580041 
Value* Gene*847wlHjreA=4 458925 6=4.504069 
Value* Gene:2288 Where A=4.453667 6=4347863 
Value* Gene:3252 where A=4<454677 6=4-477933 
Value:8Gene:997 where A=4.455181 6*4.451743 
Value* Gene:3252 where A^4.454877 8=4,477933 
Value* Gan*2111 where A=4.47195B 6=4.500438 
Value* Gene2252 where A=4.454877 8=4.477933 
Value* Getur.2121 where A=4.481919 B-4-S60041 
V^ue*Gene^288 Where AM-453887 6-4,647863 
Value* Gene:4847 where A=4.458925 6=4.604069 
Value* Gene:5107 where A=4-4S3589 8-4.455868 

Valued Gene.-997 where A-4.4551B1 6=4.451743 
Value* Gene:4647 where A-4.458925 6=4.504069 
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Metthew* Observed Aseocleti n 
ft latl n 

0.8038 41ALL 
0.8038 41 ALL 



VatueiB Gene:1539 where A=4.458918 6=4,464273 
ValuorA Gene:3258 where A=4.479301 B*4.54$814 
Valued Gene:4847 where A=4.458825 6=4.504080 
ValueiA Gene:1745 where A*4.459803 6=4.484955 
Value* Gene^548 where A=4,489232 6*4.499254 
Valued Gene:3252 where A=4.454877 B-4,477933 
Va!ue:BGene:4499 where A=4«487898 B-4.456513 
Ve!ue:BGene:6141 where A*4.473292 6*4.460415 
Value:B Gene:8373 where A-4.481822 B-4.481275 
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AML Predl tors 

Clustered Raw Data 

Matthews Observed Association 
Relation 



0.9095 

0.6788 

0.8774 
0.8768 

0.8503 



0.8503 

0.8503 
0.8503 

0.8503 

0.8503 
0.8503 

0.8482 

0.8482 

0.8458 

0.8458 
0.8458 

O.8210 
0.8157 



22 AML Value:B Gene;4B47 whore A-318.787 6=3397.48 

Vaiue:D Gene:6218 Where A-7.38462 B-157.5 C-136.158 
0=4382.71 E=43 

21 AML Vatue:B Gener3252 where A=S2.0476 B= 1536.46 0 169.333 

Value:B Gene:4847 where A^318.787 B=3397.48 
23 AML Value:B Gene>4847 where Ap318.787 6*3397.48 

Va!ue:BGene:4328 where A-46G3.05 8=1128.18 C=99 

22 AML D-10585 

Vaiue:BGene:4847 where A-318J87 B=3397.48 
20 AML Value:CGena:1144 where A*883.6 B=2760 0238.463 

Vatue:EGer*:2288 where A=118.126 B-590 C-161,834 
0=19586 E-5447.25 

VetuerB Gene:3252 where A=52.0478 B=1S38.48 C= 169.333 
Valued Gene:4847 where A=31 8.787 8=3397.48 
Value:0Gerie:172S where A*-116 B-16.439 C=1214 
20 AML O250.2D7 

Vetue^ Genei4328 where A=4603.05 B= 1128.16 089 
D-10585 

Value* Gene;4847 where A-318.787 B=3387.48 
ValuerE Gene!2288 where A=»1 19.125 B=-590 0-161 .634 

20 AML 0=19668 £=5447.25 

Value:E Gen*2288 where A-119.125 B=-590 0-181,634 

20 AML D*19568 5=5447.25 

VafuetB Gene32&2 where A=52^476 B-1536.48 0160,333 
Value* Gene:228e where A«119.125 B=-S90 O-161-034 

20 AML D=1956fl E-5447.25 

Value:B Gene;3252 where A=52.0476 B-1538.46 O160.333 
Valued Gene:4847 where A*318.787 &c>3397.4B 
Valued Gene^208 where A=119.125 B=-590 0-161.634 

20 AML D-19568 E=5447.25 

Vaiue:BGene:4847 where A=318.787 8=3397,48 
20 AML Value:B Gene:3252 where A*52.0478 B-1636.46 0169.333 

Value:BGene:4326 where A»4603.05 8=1128.16 099 
D»10S85 

Vahie:B Geoe:4847 where A=31 8.787 B-3397.48 
Value* Gene:758 where A-84 B-1487.36 04015.33 

21 AML 0*337.5 E-7997.44 F*-85.6667 

Vahie:B Gene:4847 where A=318.787 B=3397.48 
21 AML Vatue;BGone-.3252 where A*52.0476 8=1538.48 0188.333 

Valuefi Gene>4328 where A=4603.05 B=1 128.16 C=99 

0=10565 

22AML Value:C Gene: 1902 where A=2046 B*=2258 0-89.2821 

Volue:BGene:3252 where A«52,0476 B=1538.46 0 169.333 
22 AML Value:B Gen*3252 where A=52.M78 B-1536.46 C=169.333 

22 AML Value* Gene*l96 where A-6549.6 B=1 109.4 

Vatue:BGene:4328 where A-4603.05 B=1 128.18 C=99 
0-10565 

Value:E Gene;1 779 where A--74.5 B=-257 01043.27 

19 AML D-212.583 6=10030.4 

Vahiefi Gene:3252 where A-52.0478 B-1536.46 0=160.333 
Value:OGene:l725 where A=-116 B*16.439 01214 

20 AML 0*250.207 
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Matthews Obs rv«d Ait elation 
Relation 

Valoo:B Gene:4847 whom A*318.787 B*3397.48 
20AML Value:8Cene:4847whar9A=434.l17e47 B-3703.809524 



AML Predictors 

Clustered Log Normalized Data 
Matthews Observed Association 
Relation 

0.8143 21 AML Value* Geno:4847 where A-4.458925 8=4.504069 



A281-3lappA.doc 
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Gene 
Index 


Genbaokor 
Aflymctrtx 
Accession Number 
with Submission 
Date 


Gene Description 


Referenee 


400 


D38S48 
17-OCM994 


KlAA0076gene 

• 


Nomurs,N. « aL 1994, Prediction of the 
coding sequences of unidentified human 
genes. II. The coding sequences of 40 
new gene* (KiAA0041-KIAA0080) 
deduced by analysis ofcDNA clones 
from human cell line KG- 1 . DNA Ha. 
1.223-229(1994) 


720 


D87449 
27-AUG-J996 


KlAA0260gene, 
partial cds 


Nagasc,T. ei aL 1996. Prediction of the 
coding sequences of unidentified human 
genes. VL Tho coding sequences of 80 
new genes (KIAA0201-K1AA0280) 
deduced by analysis of cDNA clones 
from cell line KG*l and brain- DNA 
to. 3,321-329. 




D88270 
02OCT-I996 


DNA for 

immunoglobin lambda 
light chain 


KawasakiJC. ef al. 1997. One-megabaso 
sequence analysis of the human 
isDnunoglobulm lambda gene locus. 
Genome Rex. 7. 250461 (1997) 




D88422 
J5-OCT-I996 


CYSTATIN A 


YamazakJyM. es aL 1997. Genomic 
structure of human cystatra A. DNA 
5^.8,71-76, 


804 




Affyrnetrix. Santa Chra CA 


997 


HG432l-HT4591_at 


AhnakRebaed 
Sequence 


AftymemX Santa Oara CA 


1144 


J05243 
I2-DEOI989 


SrTANI Spectrin, 
alpha, non-erythrocytic 
1 (alpha-f odrin) 


MoonJLT, and McMahonAP- 1990. 
Generation of diversity in nonerymroid 
spectrins. Multiple polypeptides axe 
predicted by sequence analysis of 
cDN As encompassing the coding region 
of human «mcrythroid alpha>*pectrin. 
J. Biol Chrm, 265. 4427-4433, 


1260 


LQ9717 


LAMP2 Lywsoroo- 
assocl&ted membrane 
protein 2 (alternative 
products) 


Fukuda^M at of. 1988. Cloning of 
cDNAa eneodinn human hMOfiomal 
membrane glycoproteins, h*temp-I and 
h4arop-2. 0>mpErison of their deduced 
amino acid sequences. J, Biot. Chem. 
263 J 8920-18928. 

SawadaJL ef at l993.Tho genes of 
major lysosomal membrane 
glycoproteins, lamp-1 and laxrxp-2. 5*- 
flanJdng sequence of lamp-2 gene and 
comparison of ex on organization in two 
genes. 1 BioL Oam. 268. 9014-9022. 
Enawm: JBlol Ckem 268. 13010. 


1385 


120348 


Oncomodulingeae 


Fohr.U.G. et at 1993. Human alpha and 
beta parvolbumms. Structure and tissue- 
specific expression. Eur. J. Btochtnu 
215,719-727. 
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Gene 
Index 


Geo bank or 
Affymctrtx 
Accession Number 
with Submission 
Date 


Gene Description 


Reference 


1400 


L21954 ~ 


PBRIPHERAL-TYPB 
BENZODIAZEPINE 
RECEPTOR 


LwJ>. a at. 1993. Hie human 
peripheral benzodiazepine reeeptor 
gene: cloning and characpemarion of 
alternative splicing in normal tissues 
and in a patient with connoiital I teoid 
adrenal hyperplasia. Genomics IS, 643* 


1436 


L26494 


POU3F1 POU domain 
class 3, transenpbon 
factor 1 


FausJ., H80.HJ. and Fochs,B. 1994. 
Oct-6; a regulator of kemrtnocyte gene 
expression in stratified souainmia 
epithetta. Mot. CeiL Biol. 14, 3263- 


1539 


L38608 


ALCAM Activaod 
leucocyte cefl adhesion 
molecule 


Bowcn»M.A * oil 995. Cloning, 
♦napping, and characterization of 

molecule (ALCAM), a CD6 tigand. 7. 
£e* Aft* 1*1. 2213-2220. 


1615 


L42379 


Quleedn(Q6)inRNA, 
partiafede \ 


Oeo,C er n/.Moiecular cloning and 
expression of A novel bone-derived 
growth factor front a human 
osteosarcoma cell line* 


ms 


M14636 


PYGL Glycogen 
phosDhorvtue L fliver 
form) 


Ncwgard,C3. el aL (1986) Sequence 

matvttis Af tK#» r HWA dmrtwKm human 

liver glycogen phosphorytase reveals 
tissue-specinc codon usage. Proa Ncai 
Acad. Scl. USA. 03. 8132*8136. 


1745 


M1603B 


LYNV-ycH 
Yamaguchi sncoma 
viral related oncogene 
homolOB 


YananashifY. «r al (1987) Hie yes- 
related cdtular gene lyn encodes a 
possible tyrosine Idnase similar to 
oSdlck Mol Cell. BtoL 7. 237-243. 


1779 


Ml 9507 

23-NOV-1987 

ll-MAY-1988 


MPO Myeloperoxidase 


Yunadajtsf ol(1987). Isolation and 
characterization of a oDNA coding for 
human myeloperoxidase. Arch. 

Hashuiakajt er al (1988). Multiple 
species of myeloperoxidase messenger 
RNAs produced by alternative spiking 
and differential polyadenylation. 
Biochemistry 27, 5906V59 1 4. 
Erratum: Biochemistry 27. 9226. 


1829 


M22960 
13-JUL-1988 


PPGB Protective 
protein for beta- 
gelactosidese 
igalactosialldosto) 


GoUartaN J. er al (1988). Expression of 
cON A encoding me human 'protective 
protein' associated with rysomsomsl 
tan-galactosidsse and neuraminidase: 
Homology to yeast proteases. Cell 54, 
735-764. , 


1834 


M23I97 


CD33 CD33 antigen 
(diflerentiaiion antigen) 


ShnmonSaD. and Seed\B. (1988), 
Isolation of a cDNA encoding CD33, a 
diflciemiaiion antigen of myeloid 
progenitor cells. J. Immunol 141, 2797- 
2800. 



DOCID: <CA 2325806A1 J_> 



819 953 7620 



Gene 
Index 


Genbank or 
Afiymttrii 
Accession Number 
witt Submission 
Date 


Gene Desert prion 


Reference 


1882 


M2789I 
29-SEP-89 


CST3 Cy statin C 
( amyloid angiopathy 
end cerebral 
hemorrhage) 


Saiton^E ei a/.(1989}. The human 
cygrarin C gene (CST3) is a member of 
the cystatin gene family which la 
focalized on chromosome 20. Biochem. 
Blophvs. Jte. Onmmuh.161 1324-1331. 


1902 


M29474 
20-OCM989 


Recombination 
activating protein 
(RAG-naeno 


SehattAG. ei o/.(l989)Hie V(D)J 
recombination activating gene, Ra<M. 
Ce/iS*, 1035-1048. 


21U 


M62762 


ATP6C Vacuolar H+ 
ATPasc proton channel 
fiubunit 


6iUespk,G.A. at a/, (1991). CpG island 
in the region of an autosomal dominant 
polycystic kidney disease locus defines 
the 5* end of a gene encoding npoiative 
proton channel. Ptoc. NctiL Acad. ScL 
USA. 88. 4289-4293. 


2121 


M63138 


CTSDCatntpsmD 
(lysosomal &8panyi 
prOteasc) 


Redeoker.B. # a/. (1991). Molecular 
organxzation of the human cathepsin D 
uene. DNA Cell Biol. 10.423-431. 


212$ 


M63379 


CtU Clusterin 
(complement lysis 
inhibitory testosterone* 

-* mmoMM 

repressoQ provmic 
message 2; 
apolipo protein J) 


Wong,?. Hal (1993). Genomic 
organization and expression of the rat 
TRPM-2 (ciustertn) gene, & gene 

inmlfofittol in amaHmiIa. J Rtftt. Ohmn 
linptlWBSQ ID OfWpwJBia- 0Ivn wWi 

241.5021*5031. 

Wcmg,P. et al (1994). Molecular 
characterization of human TRPM- 
2/cIusterio, a gene associated with 
sperm maftemiott) epoptosw and 
neurtMkgenemiori Bar. X Blochatu 
221, 917-925. 


2137 


M63835 


HIOH AFFINITY 
IMMUNOGLOBULIN 
GAMMA FC 
RECEPTOR I "A 
FORM" PRECURSOR 


von de WinkeUGJ. ef «/. (1991). Gene 
organization of the human high affinity 
receptor for IgG, Fo-$amma4U (CD64): 
Characterization and evidence for a 
second gene. J, Bid. Chan. 266. 13449- 
13455. 


2242 


M802S4 


PBPTIDYL-PROLYL 

CIS-TRANS 

ISOMERASE. 

MITOCHONDRIAL 

PRECURSOR 


BcrgsmaJDJ. et al. (1991). The 1 
eyelophilin muhigene family of 
peptidyl-prolyi iaomeraaea. 
Characterization of three separate 
human isofonro. J. Biol Chan. 266, 
23204*23214. 


2288 


M84526 


DF D component of 


White,R.T.erdL (1992). Human 
adipsin is Identical to complement 
factor D and is expressed at high levels 
in adipose tissue, J. Biol. Ckem- 267, 
9210-9213. 


2363 


M93056 


LEUKOCYTE 

ELASTASB 

INHIBITOR 


RemoldODoimeil^E etal. (1992). 
Sequence and molecuUr 
characterization of human 
monocyte/neutrophil elastase inhibitor. 
Proe. Natl. Acad, ScL USA. 89, 5635- 
5639. 
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I Gene 
J Index 


1 Cenbank or 
I Affyinefrfc 
1 Accession Number 
I with Submission 
Date 




Reference 


2402 


M96326 


Aznrocidm gene 


Wr8aiMr.a^aA(l99I).aoiimgof " 
the cDNA Ibr the serine protease 
bomolog CAP37/nzurocidni t a 
microbicidal end chemotactic protein 
torn human granulocytes. J. Immunol. 
147.3210*3214. 

Zintmer^l eial (1992). Three human 
elastasc~likegenc* co-ordinately 
expressed in the myelo-monocyte 
lineage are organised as a single genetic 
locus on 1 9pttr. Proc Natl Acad. Set 
U.SJ. 89. 8215-821Q. 


2546 

I 


S82470 


opi**niaugnant 
expfgotfonhcahtftcod 
gene/tumor 
progression-enhanced 
^ene 


Fokwugs-Wufflon^N. m al (1996). ""I 
Molecular analysis of a gene. DDI , ] 
overexpressed m bladder and breast 
carcinoma. Anticancer Rts. 16. 1085* 1 
1090. | 


I 2800 


U00672 
10-AUG-I993 


iLiOK utcneukin 10 
receptor 


Liu,Y.€foJL (1994). Expression cloning I 
and enaroctcriarjon of a human 1L- 10 1 
receptor. ■/. Jnummo^ 1821-1829. j 


IliS 


U14971 
! 21-SEP-1994 


RP$9Ribosomal 

protein S9 
■ 


FngerioJJ* etaL (1995). Qomng, 
sequencing and cxpresuon of the L5. 
WI,U7a»L28,S5,S9,SI0andS29 
human ribosomsl protein mRNA*, I 
Bhchim- Biophv* Ada 1262» 64-68. 




U41635 ~ 
30-NOV-1995 


os^precuroeor 
mRNA 


Su,Y.A. «r al (1996> Complete 
sequence analysis of a gene (OS-9) I 
ubtquitoutly expressed in human tissues | 
sndampIiMnsarcornas.A/o/. 1 
Cordite*. 1$. 270475. J 


3252 I 
I 3258 


U46499 " 
1 8-JAN-l QQA 


GLUTATHIONES- 

TRANSFERASE, 

MICROSOMAL 


DeJcng^X. # al (1988). Qoic I 
expression of rat and human 1 
microsomal glutathione S-transfersscs. 
J. Biol Chan, 263, 8430-8436. 

KdnerJM J. er al (1996). Structural 
organisation of the human microsomal 
glutathione S-tronsfarase gene (OST1 2). 
Gmomta^ 100-103. 




U4675I " 
J9-JAN.I996 


WHttphoiyrosine 

independent Ugandp62 
ivrine ug*. om 
domain mRNA 


foun&L er o/.(1996). Molecular cloning 
of a phosphoryinsme^mdependenf 
Kgand of the p561ck SH2 domain. Proc 
Natl Acad, Sci. U.Sjt. 93. 5991-5995 


3320 


U50I36 

Z7-FEB-1996 , 


LeuJcotrieneC4 
synthase (LTC4S) gene 

i 
I 

1 MM 


Pawose^F. ct al (1996). Molecular " 
tloning of the gene for human 
cukofriene C4 synthase OreaniMtiAfi 
WfCleoTide sequence, and chromosomal 
localization to 5q35.7. Biol Chan. 271, 
I1356M1361. 


f 3482 ! 
! 1 


J603I9 i 
I0«JUN*1996 J 
) 


iLA-HMHCpmteln 
lLA.H(herediory 
UKmochrernamtls) i 

1 J 


FcdeM.N. et o/,(l996). A novel MHC 
Hase Mike gene is mutated in patients 
with hereditary hsemochnmuuosi& 
Vo*r* Gen*. 13. 399-408. 
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Index 


Afiymetrlx 
Accession Number 
with Summation 
Hate 


Gene Description 


Reference 


3525 


U63289 
08-JTJL-1996 


. RNA-binding protein 
(NAB50) mftNA 


Tuncbenko.LT etat. (1996). 
identification of a (CUG)n triplet repeat 
RNA*binding protein and its expression 
in myotonic dystrophy. Nucleic Acids 
Jfcuv 24. 4407-4414. 


3581 


U66580 
12-AUG-1996 


Putative G protein- 
coupled receptor 

/nDD'i i \ mama 
(OrlUlj gene 


0 > Dowd,B.F. <f 41 (1997). Cloning and 
chromosomal mapping of four putative 
novel human G-ptoteuvcoupled 


3820 


U8I554 

lrt.nPf\10Q/C 
1 U , UCV» 1 wo 


CaM kinase U isofonn 

fnRNA 


BrecnMA. and AsbcroftjS JJH. (1997). 
A truncated fsoform of 
CaWceftiwAiliivdcpcTtdcflt protein 
kinase D expressed in human islets of 
Langerhana may result from craaa- 
snUdiur. FEBSUst 409. 375-379. 




uoz /ay 

19-DEC-1996 


HoxA9 mRNA 


Rozenftld ( S < esal Human HOXA9 
homeobox cDNA sequence. 


4190 


X 16706 
30-OCT-1989 


FOS-RBLATED 
ANTIGEN 2 


Matsu^tf^l990). Ration of 

expression during monocyte* 
macrophage differentiation. Oncogene 
5.249*25$, 


4196 


X17042 
2MAN-1990 


PRG1 Proteoglycan I, 
secretory granule 


SteiIrecht,CM. and SaundeTs,G.F. 
(1989). Nucleotide sequence of a cDN A 
encoding a hemopoietic proteoglycan 
core protem. NucUicAdd* Re* 17, 
7523. 


4229 


X52056 
O7.MAR-W0 


SPll Spleen focus 
fonxuuog virus (SFPV) 
ptoviral int£gi*uon 
oncogene spil 


RayJD. ef oZ(l 990). The human 
borootogue of the putative proto- 
oncogene Spi-1: characterization and 
expression in tumors. Oncogene 5, 663- 
668. 


4322 


X59065 

IX A Bit • AAV 

16-APR-I991 


FGFt Fibroblast 
growth factor I 
(acidic) {(Utemative 
products) 


Wang,WJ>. et al (1991). Oomng and 
sequence analysis of the human acidic 
fibroblast growth factor gene and its 
preservation in leukemia patients. 
Otetfoie*. 1521-1529. 


4928 


VCA4 !■* 

08-MAY-1991 


FROTBASOMB IOTA 
CHAIN 


Bey,F. et at (1993). The prosomai 
RNA-bmdmg protem p27K is a member 
of the alpha-type human prosomai gene 
fcmiry. MoL Gem Genet. 237. 193*205. 


4366 


X61567 
25-SEP-1991 


ARHG Ras homolog 
gene family, member Q 


Vincent^ es al (1992). Growth- 
mguiared expression of ihoG, a new 

umiiVH VI UK 190 nuilfWlOg 

ftmttv. Ma/. Getf. iNof. 12. 3138-3148. 


4377 


X62654 
17-OCT-1991 


ME491 gene extracted 
from H .sapiens gene 
forMe49l/CD63 
antigen 


Hottaji ef at. (1992). Genomic 
structure of the ME491/CD63 antigen 
gene and functional analysis of the 5*- 
flanking regulatory sequences. 
Blochem. Biophys. Res. Ccmmun. 185, 
436-442. 
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Gerte~ 
Index 

"4455 


Genbaakor 
Affymetm 
j Accession Number 
with Submission 
Date 


Gene Description 


Reference 




X70297 ^~ 
04-PBB-I993 


tcctptor> nicotinic, 
alpha polypeptide 7 


ren&A. er ox (1994). Human alpha 7 
acetylcholine receptor, cloning of the 
alpha 7 eubunit from the SH-SY5Y cell 
line and detenmnstion of 
pnarmacologtcal properties of native 
receptors end functional alpha 7 
homomejB expressed in Xenopos 
oocytes. hfol Pharmacol 4$. 546*554. 


47*0 


X89066 
06WUU1995 


TRPCI TVansient 
receptor potcntisJ 
channel 1 


Wes.PI>. et al (1995). TRPCI, a 
human homo tog of a Oroaophlla store* 
operated channel. Proc. Natl. Acad. 
ScLUSjj. 92.9652.9636. 


"4M7 


X95735 
16-FBB-I996 


Zyxin 


ZuinbrumU. and Trueb.B« (1996). A 
ryxin-relatcd protein whose synthesis is 
reduced in virally tranefbimcd 
fibroblasts. £w. J. Biochtm. 241, 657- 
663. 


5107 


Z29067 
15-DEC-I993 


Nek3mRNAfcr 


Schuta^J. and Nigg^EA. (1993). 
Identification of 21 novel human 
protein kinases, including 3 members of 
a family related to the cell cycle 
regulator njmA of Aspergillus nldulans. 
Ceil Growth Differ. 4, 821-830. 

Schufr^SJ. at aJL(1994). Cell cycle- 
dependent expression of Nek2, a novel 
hmnan protein kinase related to the 
NIMA mitotic regulator of Aspergillus 
nlduiam Gtil Growth Differ. 5, 625- 
635. 


5175 


Z49269 
I8-MAY-I995 


ChemokineHCC-t 


PardigolA at al Kuoleotide Sequence 
of the Gene for the Human Chemokine 
HCCI. 
Unpublished 


5280 


J02783 
15-DEC-1988 


P4HR PmMliflovn. 

proline, 2-oxoglutarate 
44ioxygcnase (proline 
hydroxylase), beta 
polypeptide (protein 
disulfide wamerese; 
thyroid hormone 
bfomnj? protein oSS) 


Cheng^.Y. ar al (1987). The 
nucleotide sequence of a human cellular 
thyroid hormone binding protein 
present in endoplasmic reticulum. J. 
Biol. Cktm. 262, 1 1221-1 1227. 


5318 


L43576 


(clone EST02946) 

mWA 

May 6 1998 


Tuwns^Kt ef «/.< 1995). 130 kb of 
DNA sequence reveals two new genes 
and a regional duplication distal to the 
human iduroiute-2-sulfcte sulratase 
locus. Genomt Rts. 5 | 71-8. 
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[ Gene 
Index 


Gcobank or 
Aflymetrtx 
Accession Number 
with Submission 
Date 


GcseXkscrlptloft 


Reference 


5432 


U73936 
10OCM996 


Soluble protein Jogged 
mJWA, partial cds 


LtndseIl,CE etaL (1995). lagged; a 
mammalian llgsnd that activates 
Notchl.Ce//W> 909-917. 

UL- ct o/. (1997). Ahgflb syndrome is 
caused by mutations in human Jaggedl, 
which encodes a ligend tor Notch 1. 
Nature Genet. J6, 243-25 1 : 

UJL etaL (1998). The human homnlog 
of rat Jagged! expnased by marrow 
moras inhibits difitrennauon of 32D 
cells through interaction win Notch 1. 
Jnmi«ffya\43-S5. 


poB 


U197I3 
10-JAN-I995 


Allograft mflanamatovy 
fector-1 (AIP-1) 
mRNA 


Utaus,U at at (1996> Allograft 
fflflanunaXMy factory*!, A cytokine 
responsive macrophage molecule 
expressed in tranaplamed human hearts. 
Transplantation 61, 1387-1392. 


5833 


U05572 
25-JAN-1994 


MANBMannosidtte 
alpha-B (lysosomal) 


Neoes,VJL and Scomidt.M.C (1994). 
Human lysosomal alpba-mannosidase; 
isolation and audeetide sequence of the 
full-length cONA. Blachem. Blophyt. 
iter. Cbmmtm. 200, 239-245 

Btm1ianj,Ccr0t(l995). Partial 
sequeooe of the purified protein 
confirms the identity ofcDNA coding 
for human lysosomal alpha* 

363-366. ' 


5955 7 


U50327 


Protein kinase C 
substrate 80IC~H gene 
(PRKCSH) 


QphoflJLA. ef of. A3 Mb region for *" • 
meFHM locus on 19pl3 J-pl3.2; 
cxclurion of PRKCSH as a candidate 




M32304 


TlMP2TisKucinhibimr 
or metal kkpreielnBse 2 


Boonc.TC. tf el(l990). cONA cloning 
and expression of a inetaUoprotoinase 
inhibitor related to tissue inhibitor of 
ntetalloproteiiustfi. hx>c Natl. Acad. 

SgilSSfj^2grogro4. _ 


[5541 


L09209 "~ 


Arurz Axnytotd oeta 
(A4) precursor-like 
protein 2 


Sprecher,GA. etal. (1993). Molecular - 
Cloning of the cDNA for a Human 
Amyloid Precursor Protein Homolog 
(APPK), Biochemistry 32. 4481^486. 


6141 


Y0876S 
10-OCM996 


ZPfAX protein 
alternatively spliced 
product 


AmingpS. etaL (1996). Mammalian 
splicing factor SF1 is encoded by 
variant cDNAs and binds to RNA. RNA 
2,794-810. 


[6155 


X64072 
05-MAR-1992 


SELL Leukocyte 
adhesion protein beta 
submit 


wetaman^3. cx at (1991). The gene 
organisation of the human beta 2 
unegriu subunsr (CD18). FEES Lett, 
97-103. 
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| Gene 
I Index 

J 620 J 


Geabankor 
Afiymetrtx 

Y00787 

03-MAY-I988 


<^e Description 
PRECURSOR 


M**rtm*JL«al(im).hl lecular" 
1 eloniug of a human monocyte^dcrived 

neutrophfl chemotactic fector (MDNCF) 

and the mducrioa of MDNCF mRNA by 
j mterleukin 1 and tumor necrosis Actor. 

/ Exd. Med. 167. 1883-1*93, 




M27783 


ELA2 Blfistats«2. 
neutrophil 


j Farley J), tt al. (1988). Molecular 
I cloning ofhumm neutrophil elastaae. 
J Biol, Chm. fiBBBSSfeSS *" 7 - 1 


j 6373 


1*81695 


ITOAX inttgrin, alpha 
X (antigen CD 11C 
(p!50)> alpha 
polypeptide) 


1 Cort>i,AX, tt al. (1987). cDNA ckmingH 
and complete primary structure of the 1 
j alpha submit of o leukocyte adhesion I 
I glyevprateiii. pISO,95- EMBQJ. 6. 
1 4023-4(128. | 




M83652 


PFC Properdin P 
factor, complement 


I NobmJK-F. « oi: (1991). Molecular j 
1 c,oning . of coding for 
1 properdin, a positive regulator of the 1 
1 alternative pathway of human 
complement Eur. J. Immunol 21. 77K 
776. I 

Weilcr^M and MavesJCfC (1992). j 
Doectioo of properdin mRNAm 
human peripheral blood monocytes and 1 
spleen. J. Lab.ain.Med. 120,762- 1 
766. 1 

KinoaWta^ erol. (1992). A niembe7o7| 
me C/EBP family, NF-IL6 beta, forms a j 
hetfirodimerand transcriptionally | 
syncrgiaw with NF4L6, Proc Nail \ 
Acad. ScL U.S.A. 89. 1473-1476. 




M83667 


Nr-lLo-octs protein | 
mRNA 


[6502 


U31973 
2WUU.1995 


"HS^odfeTOwA 5 "! 

«ubunit(PDB6Q 

mRNA 


Firiev.Nl etal (1995). Gene structure 1 
and ammo acid sequence of the human 1 
cone photoreceptor ctiMP. | 
phosphodiesterase alpha' gubunit 1 
(PDBA2) and its chromosomal 
localization to I0q24, Genomics IB, 
429-135. 

Viczian^S. ei oJ. (1995). [solution end 
characterization of a cDNA encoding I 
me arpna summit of human cone cGMP- I 
phosphodiesterase. Gene 166.205-211. 1 


J 6563 


U5I333 
14-MAR-1996 


HK3 Hexoknusc 3 j 
(while cell) I 
1 

i 


FurutaJtertf^l996),S«|uenceof 
human hexokuuae 1I[ cDNA and 
assignment of the human hexokmase in 
jene (HK3) to chromosome band 
f q35.2 by fluorescence in situ 
bybridlzatvon, Genomics 36. 206*209. 


( 


£54367 1 
12-OCT-199S 


3B DBF -Plectra p 
1 i 
i 
< 

h 


Lm f CG. et of. (1996). Human plectin: 
wganizarion of the gene, sequence 
malysts, and chromosome localization 

Pfoe. Nail Acad. ScL USji. 
tt, 4278-4293. 1 
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Index 


Genbankar 
Aftymetrtt 
Accession Number 
with Submbxlon 


GeaeDotrtptfoo 


Reference 


6797 


J03801 
27-OCT-1988 


LYZLyBozyme 


QmgJLP. «r o/.(J98B). Cloning the 
bumtn Yytorym cDNA: inverted Alu 
repeal in the mRNA and In smi 
hybridization for microphones and 
Pane* cells. Proc Nat!. Acad. Set 
(7X4. 85.6227*6231. _ 


6803 


"Ml 904 


LYZLysozyme 


Yosfaumm^K. el al (1988). Human 
lysozymc sequencing of a cDNA, and 
expression and accretion by 
Saccbffomyces cerevixiAe, fltecA** 
Biopkvs. Res. Com/mm. ISO. 794-801. 


6806 


X14008 
I8-JAN-1989 


LywjzymcgracfEC 
3^.1.17) 


Petei*CW. el rt(1989). Ihe human 
lyeozyme gene. Sequence organization 
end chromosomal localization, Eur. J. 


6919 


XI6546 
18-SEP-1989 


RNS2 Ribonucleoae 2 
(cosmopliil-derivod 
neurotoxin; EDN) 


HammnJKJ. er<r£ (1990). Structure 
and chromosome localization of the 
human eosraophrt-demed neurotoxin 
and eosinophil canonic protein genes: 
evidence for intronless coding 
sequencee m the ribonuclease gene 
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What Is Claimed Is: 

1 . A method for diagnosing acute lymphoblastic leukemia (ALL), 
comprising: 

(a) measuring the levels of gene expression of leukotriene C4 
synthase (LTC4S) gene and Zyxin in a biological sample taken from a patient 
suspected of having ALL; and 

(b) comparing the levels of gene expression in said biological 
sample with a standard sample, wherein low levels of expression are indicative 
of a diagnosis of ALL. 

2. A method for diagnosing ALL, comprising: 

(a) measuring the levels of gene expression of LYN V-yes-1 
Yamaguchi sarcoma viral related oncogene homolog, PPGB Protective protein 
for beta-galactosidase, and Zyxin in a biological sample taken from a patient 
suspected of having ALL; and 

(b) comparing the levels of gene expression in said biological 
sample with a standard sample, wherein low levels of expression are indicative 
of a diagnosis of ALL. 

3. A method for determining a prognosis of a patient with AML, 
comprising: 

(a) measuring the levels of gene expression of POU3F1 POU 
domain, class 3, transcription factor 1 and GB DEF - homeodomain protein 
HoxA9 mRNA in a biological sample taken from a patient with AML; and 

(b) comparing the levels of gene expression in said biological 
sample with a standard sample, wherein medium-high levels of POU3F1 POU 
domain, class 3, transcription factor 1 and high levels of GB DEF = 
homeodomain protein HoxA9 mRNA, are indicative of a favorable prognosis. 
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4. A method for screening drags which are useful for treating acute 
leukemia, comprising: 

(a) administering to a cell culture a drug of interest; 

(b) comparing the levels of gene expression of leukotriene C4 
synthase (LTC4S) gene and/or Zyxin before administration of said drug with the 
levels of gene expression after administration of said drug, wherein a modulation 
of gene expression level after administration of the drug is indicative of a drug 
useful for treating acute leukemia 

5. A method for screening drugs which are useful for treating acute 
leukemia, comprising: 

(a) administering to a cell culture a drug of interest; and 

(b) comparing the levels of gene expression of LYNV-yes-1 
Yamaguchi sarcoma viral related oncogene homolog, PPGB Protective protein 
for beta-galactosidase, and/or Zyxin before adnunistration of said drug with the 
levels of gene expression after administration of said drug, wherein amodulation 
of gene expression level after administration of the drug is indicative of a drug 
useful for treating acute leukemia 

6. A kit for diagnosing ALL, comprising: 

(a) a means for measuring gene expression of leukotriene C4 
synthase (LTC4S) gene; and 

(b) a means for measuring gene expression of Zyxin. 

7. A kit for diagnosing ALL, comprising: 

(a) a means for measuring gene expression of LYN V-yes-1 
Yamaguchi sarcoma viral related oncogene homolog; 

(b) a means for measuring gene expression of PPGB 
Protective protein for beta-galactosidase; and 

(c) a means for measuring gene expression of Zyxin. 
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8. A method for screening drugs which are useful for treating acute 
leukemia, comprising: 

(a) administering to a cell culture a drug of interest; and 

(b) comparing the levels of gene expression of POU3F1 POU 
domain, class 3, transcription factor 1 and/or GB DEF = homeodomain protein 
HoxA9 mRNA in a biological sample taken from a patient with acute leukemia, 
wherein a modulation of gene expression level after administration of the drug 
is indicative of a drug useful for treating acute leukemia. 

9. The use of gene expression levels of leukotriene C4 synthase 
(LTC4S) gene and Zyxin to diagnose ALL, 

10. The use of gene expression levels of LYN V-yes-1 Yamaguchi 
sarcoma viral related oncogene homolog, PPGB Protective protein for beta- 
galactosidase, and Zyxin to diagnose ALL. 

11. The use of gene expression levels ofPOU3Fl POU domain, class 
3, transcription factor I and GB DEF = homeodomain protein HoxA9 mRNA for 
the prognosis of AML. 

12. A method for diagnosing acute myeloid leukemia (AML), 
comprising: 

(a) measuring the levels of gene expression of Zyxin and 
ELA2 Elastase 2, neutrophil, in a biological sample taken from a patient 
suspected of having AML; and 

(b) comparing the levels of gene expression in said biological 
sample with a standard sample, wherein high levels of expression are indicative 
of a diagnosis of AML. 
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Methods for the Diagnosis and Prognosis of Acute 

Leukemias 

Abstract 

The present invention relates to the diagnosis of the distinction between 
acute lymphoblastic leukemia (ALL) and acute myeloid leukemia (AML) and 
prognosis of AML. Disclosed is a means to diagnose the distinction between 
ALL and AML employing measurement of the abundance of the nucleic acid or 
protein products of small combinations (two, three or more) of particular human 
genes. The invention further describes the use of the measurement of the 
abundance of the nucleic acid or protein product of two human genes for 
prognostic indication in AML. The invention also relates to therapies targeted at 
these indicator genes, and the screening of drugs for cancer that target these 
indicator genes or their protein products. 
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